Lesson: Consolidating Matching Records 


e Salvage useful data from matching records before discarding them. For example, when 
checking a driver license file against your main source file, you might pick up gender or 
date-of-birth data to add to your master record. 


e Post updated data, such as the most recent phone number, to all of the records in a match 
group. You can choose to post data to the master record, to all the subordinate members 
of the match group, or to all members of the match group. The operations you Set up in the 
Best Record operation always start with the highest priority member of the match group 
(the master) and work their way down to the last subordinate, one at a time. This ensures 
that data can be salvaged from the higher-priority record to the lower-priority record. 


Best record strategies act as a filter for taking action on other fields. There are several 
strategies to assist in setting up the best record operation quickly and easily. 


e Date—Select a date field and determine best record based on the oldest or most recent 
date. 


e Length—Select a string field and determine best record based on the shortest or longest 
string of data in the field. 


e Non Blank—Select any field and determine best record based on completeness of data in 
the field. 


e Priority Number—Select a numeric field and determine best record based on the highest 
or lowest number. 


¢ Priority String—Select a string field and determine best record based on ascending or 
descending alphabetic order for the data. 


e Custom—Base your strategy entirely on custom Python code. This allows you to open the 
Python Expression editor and create custom Python code. 


If none of these strategies fit your project needs, create a custom best record operation, using 
custom Python code. 


Note: 
The Best Record Summary report shows statistics about best record processing 


that indicate configuration settings and the results of the posting. This information 
can be used to assist in fine tuning configuration settings. 





Unique ID Generation 


A unique ID refers to a field within your data that contains a unique value that is associated 
with a record or group of records. A unique ID is to data what a social security number (SSN) 
is to a person. It creates and tracks data relationships from between multiple jobs. With the 
Unique ID operation, you can set your own starting ID during the first execution. For each 
subsequent execution, the unique ID is the next sequential value based on the existing highest 
unique ID. You can set your own starting ID for new key generation or have it dynamically 
assigned based on existing data by determining where the highest unique ID from the 
previous run ended. 


Table 53: Unique Identification Numbers First Load 
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